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PROVISIONAL SPECIFICATION 
for the invention entitled: 



A NOVEL GROWTH FACTOR AND A GENETIC SEQUENCE 
ENCODING SAME - II" 



The invention is described in the following statement: 
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A NOVEL GROWTH FACTOR AND A GENETIC SEQUENCE 
ENCODING SAME - II 

The present invention relates generally to an isolated molecule having vascular 
endothelial growth factor-like properties and to a genetic sequence encoding same. The 
molecule will be useful in the development of a range of therapeutics and diagnostics 
useful in the treatment, prophylaxis and/or diagnosis of conditions requiring enhanced 
or diminished vasculature and/or vascular permeability. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for 
the nucleotide and amino acid sequences referred to in the specification are defined 
following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

Vascular endothelial growth factor (hereinafter referred to as "VEGF"), also known as 
vasoactive permeability factor, is a secreted, covalently linked homodimeric glycoprotein 
that specifically activates endothelial tissues (Senger et aL, 1993). A range of functions 
have been attributed to VEGF such as its involvement in normal angiogensis including 
formation of the corpus luteum (Yan et aL, 1993) and placental development (Sharkey 
et aL, 1993), regulation of vascular permeability (Senger et aL, 1993), inflammatory 
angiogenesis (Sunderkotter et aL, 1994) and autotransplantation (Dissen et aL, 1994) and 
human diseases such as tumour promoting angiogenesis (Folkman & Shing, 1992), 
rheumatoid arthritis (Koch et al. t 1994) and diabetes related retinopathy (Folkman & 
Shing, 1992). 
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VEGF is, therefore, an important molecule making it a potentially valuable target for 
research into therapeutics, prophylactics and diagnostic agents based on VEGF or its 
activities. There is also a need to identify homologues or otherwise related molecules 
5 for use as an alternative to VEGF or in conjunction with VEGF. 



In work leading up to the present invention, the inventors sought the multiple endocrine 
neoplasia type I susceptibility gene (MEN1). Surprisingly, the inventors discovered that 
a genetic sequence excluded as a candidate for the MEN1 gene was nevertheless a new 
10 growth factor having some similarity to VEGF. 

Accordingly, one aspect of the present invention comprises a biologically isolated 
proteinaceous molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
15 NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

Another aspect of the present invention provides a biologically isolated proteinaceous 
molecule having the following characteristics: 
20 (i) comprises an amino acid sequence having at least about 15% similarity but at 

least about 5% dissimilarity to all or part of the amino acid sequence set forth 

in SEQ ID NO:2; 
(ii) exhibits at least one property in common with VEGF. 

25 A related aspect of the present invention contemplates a biologically isolated 
proteinaceous molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 

NO:2; 

30 (ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\fjlk-\ family of receptors; 
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(c) ability to induce cell migration, cell survival and/or an increase in 
intracellular levels of alkaline phosphatase. 

By "biologically isolated" is meant that the molecule has undergone at least one step of 
5 purification from a biological source. Preferably, the molecule is also biologically pure 
meaning that a composition comprises at least about 20%, more preferably at least about 
40%, still more preferably at least about 65%, even still more preferably at least about 
80-90% or greater of the molecule as determined by weight, activity or other convenient 
means, relative to other compounds in the composition. Most preferably, the molecule 
10 is sequencably pure. 

Another preferred aspect of the present invention provides the molecule in recombinant 
form. 

15 According to this aspect of the present invention, there is provided a recombinant 
molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

20 

A related aspect of the present invention is directed to a recombinant molecule having 
the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 

25 in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A further related aspect of the present invention contemplates a recombinant molecule 
having the following characteristics: 
30 (i) comprises an amino acid sequence having at least about 15% similarity but at 

least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 

NO:2; 
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(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with JltAlflkA family of receptors; 

(c) ability to induce cell migration, cell survival and/or an increase in 



The present invention also contemplates genomic or partial genome clones encoding a 
proteinaceous molecule having at least about 15% amino acid similarity but at least 
about 5% dissimilarity to SEQ ID NO:l. 



The amino acid sequence set forth in SEQ ID NO:2 corresponds to human VEGF 
(referred to herein as "VEGF 165 "). Accordingly, the molecule of the present invention 
is VEGF-like or is a homologue of VEGF but comprises an amino acid sequence which 
is similar but non-identical to the amino sequence of VEGF, Although the present 

15 invention is exemplified using a human VEGF-like molecule, this is done with the 
understanding that the instant invention contemplates the homologous molecule and 
encoding sequence from other mammals such as livestock animals (e.g. sheep, pigs, 
horses and cows), companion animals (e.g. dogs and cats) and laboratory test animals 
(e.g. mice, rats, rabbits and guinea pigs) as well as non-mammals such as birds (e.g. 

20 poultry birds), fish and reptiles. In a most preferred embodiment, the VEGF-like 
molecule is of human origin and encoded by a gene located at chromosome 1 lql3. The 
present invention extends, therefore, to the genomic sequence or part thereof encoding 
the subject VEGF-like molecule. 

25 Preferably, the percentage similarity is at least about 30%, more preferably at least about 
40%, still more preferably at least about 50%, still even more preferably at least about 
60-70%, yet even more preferably at least about 80-95% to all or part of the amino acid 
sequence set forth in SEQ ID NO:2. 

30 In a particularly preferred embodiment, the VEGF-like molecule of the present invention 
comprises a sequence of amino acids as set forth in SEQ ID NO:4 or is a part, fragment, 
derivative or analogue thereof. The amino acid sequence set forth in SEQ ID NO:4 is 



5 



intracellular levels of alkaline phosphatase. 



10 
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also referred to herein as "SOM175 short ". Particularly preferred similarities include 
about 19-20%, and 29-30%. Reference herein to derivatives also includes splice 
variants. Accordingly, the present invention extends to splice variants of SOM175 shorr 
Examples of splice variants contemplated by the present invention include but are not 
5 limited to variants with an amino acid sequence substantially as set forth in at least one 
of SEQ ID NO:6, SEQ ID NO:8 and/or SEQ ID NO: 10 or mutants or derivatives or 
further splice variants thereof. 

Another embodiment provides a recombinant molecule having the following 
10 characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:4 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 
15 (ii) exhibits at least one biological property in common with VEGF. 

Another embodiment provides a recombinant molecule having the following 

characteristics: ... 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:6 or having at 
20 least about 15% similarity to all or part thereof provided that said amino acid 

sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

25 Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 8 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 

30 set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 
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Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having at 
least about 15% similarity to all or part thereof provided that said amino acid 

5 sequence is at least about 5% dissimilar to all or part of the amino acid sequence 

set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

Such properties of VEGF include at least one of: 
10 (a) ability to induce proliferation of vascular endothelial cells; 

(b) an ability to interact with flt-\lflk-\ family of receptors; 

(c) an ability to induce cell migration, cell survival and/or an increase in intracellular 
levels of alkaline phosphatase. 

15 In accordance with the present invention, a preferred similarity is at least about 40%, 
more preferably at least about 50% and even more preferably at least about 65% 
similarity. 

Still a further aspect of the present invention contemplates a peptide fragment 
20 corresponding to a portion of the amino acid sequence set forth in SEQ ID NO:4 or a 
splice variant thereof such as set forth in SEQ ID NO:6, SEQ ID NO:8 or SEQ ID 
NO: 10 or a chemical equivalent thereof. The biologically isolated or recombinant 
molecule of the present invention may be naturally glycosylated or may comprise an 
altered glycosylation pattern depending on the cells from which it is isolated or 
25 synthesised. For example, if produced by recombinant means in prokaryotic organisms, 
the molecule would be non-glycosylated. The molecule may be a full length, naturally 
occurring form or may be a truncated or otherwise derivatised form. 

Yet another aspect of the present invention is directed to a nucleic acid molecule 
30 encoding the VEGF-like molecule herein described. More particularly, the present 
invention provides a nucleic acid molecule comprising a sequence of nucleotides 
substantially as set forth in SEQ ID NO:3 or having at least 15% similarity to all or part 
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thereof or being capable of hybridising under low stringency conditions to a reverse 
complement of the nucleotide sequence as set forth in SEQ ID NO:3 provided that the 
nucleic acid sequence having at least 15% similarity but at least 30% dissimilarity to the 
nucleotide sequence as set forth in SEQ ID NO:3. The nucleotide sequence set forth in 
5 SEQ ID NO:3 is also referred to herein as "SOM175". Preferably, the percentage 
dissimilarity is about 35%, more preferably about 39% and even more preferably about 
40-50% or greater. 

For the purposes of defining the level of stringency, reference can conveniently be made 
10 to Sambrook et al (1989) at pages 9.47-9.51 which is herein incorporated by reference 

where the washing steps disclosed are considered high stringency. A low stringency is 

defined herein as being in 4-6X SSC/0. 1-0.5% w/v SDS at 37-45°C for 2-3 hours. 

Depending on the source and concentration of nucleic acid involved in the hybridisation, 

alternative conditions of stringency may be employed such as medium stringent 
15 conditions which are considered herein to be 1-4X SSC/0.25-0.5% w/v SDS at > 45°C 

for 2-3 hours or high stringent conditions considered herein to be 0.1 -IX SSC/0. 1% w/v 

SDS at 60°C for 1-3 hours. 

The present invention further contemplates a nucleic acid molecule which encodes a 
20 VEGF-like molecule as hereinbefore described having at least 15% nucleotide sequence 
homology to SEQ ID NO:3. Preferred levels of homology include at least about 40%, 
more preferably around 60-70%. 

The present invention is further directed to the murine homologue of human VEGF 
25 (referred to herein as "mVRF"). The mVRF has approximately 85% identity and 92% 
conservation of amino acid residues over the entire coding region compared to human 
VEGF. The mVRF is encoded by a nucleic acid molecule comprising a nucleotide 
sequence substantially as set forth in Figure 9. 

30 The VEGF-like molecule of the present invention will be useful in the development of 
a range of therapeutic and/or diagnostic applications alone or in combination with other 
molecules such as VEGF. The present invention extends, therefore, to pharmaceutical 
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compositions comprising the VEGF-like molecule or parts, fragments, derivatives, 
homologies or analogues thereof together with one or more pharmaceuticaliy acceptable 
carriers and/or diluents. Furthermore, the present invention extends to vectors 
comprising the nucleic acid sequence set forth in SEQ ID NO:3 or having at least about 
5 1 5%, more preferably about 40% and even more preferably around 60-70% similarity 
thereto but at least 30% and more preferably around 39% dissimilarity thereto and host 
cells comprising same. In addition, the present invention extends to ribozymes and 
antisense molecules based on SEQ ID NO:3 as well as neutralizing antibodies to the 
VEGF-like molecule. Such molecules may be useful in ameliorating the effects of, for 
1 0 example, over expression of VEGF-like genes leading to angiogenesis or vascularization 
of tumours. 

The present invention also contemplates antibodies to the VEGF-like molecule or nucleic 
acid probes to a gene encoding the VEGF-like molecule which are useful as diagnostic 
1 5 agents. 

The present invention is further described by reference to the following non-limiting 
Figures and/or Examples. 

20 In the Figures: 

Figure 1 Nucleotide sequence [SEQ ID NO:l] and corresponding amino acid 



sequence [SEQ ID NO:2] of VEGF 



165* 



25 Figure 2 Nucleotide sequence [SEQ ID NO:3] and corresponding amino acid 
sequence [SEQ ID NO:4] of SOM175, 



Figure 3 



Results of BLAST search with SOM175 protein sequence. 



30 Figure 4 



BESTFIT alignment of VEGF cDNA and SOM175 cDNA. 
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Figure 5 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
the nucleotide level. 

5 Figure 6 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
the amino acid level. 

Figure 7 Diagrammatic representation of SOM175 and its splice variants. 
Figure 8(a) Diagrammatic representation of genomic structure of human SOM175 
10 genomic showing exon/intron map. 

Figure 8(b) Diagrammatic representation of genomic structure of human SOM175 
showing exon/intron boundries. 

1 5 Figure 9 Nucleotide and predicted peptide sequences derived from mVRF cDNA 
clones. Numbering of nucleotides are given on the left, starting from the A of the 
initiation codon. Amino acids are numbered on the right, starting from the first residue 
of the predicted mature protein after the putative signal peptide has been removed. The 
alternately spliced region is double underlined and the resulting peptide sequence from 

20 each mRNA is included. A potential polyadenylation signal is indicated in boldface. 
Start and stop codons of mVRF 167 and mVRF 186 are underlined and a polymorphic AC 
repeat in the 3' UTR is indicated by a stippled box. The positions of intron/exons 
boundaries are indicated by arrowheads. 

25 Figure 10 BESTFIT alignments of human and murine VRF protein isoforms. A: 
mVRF 167 and hVRF 167 . B: mVRF 186 and hVRF 186 from the point where the sequences 
diverge from the respective 167 amino acid isoforms. Amino acid identities are marked 
with vertical bars and conserved amino acids with colons. An arrow marks the predicted 
signal peptide cleavage site of human and mouse VRF. 

30 
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Figure 11 BESTFIT alignment of mVRF 167 and mVEGF 188 (Breier et al y 1992) 
peptide sequences. An arrow marks the signal peptide cleavage site of mVEGF. 
Identical amino acids are indicated by vertical bars and conservative substitutions by 
5 colons. Numbering of amino acids is as described in the legend to Figure 9. 

Figure 12 Comparison of gene structure between VRF (a generic VRF gene is 
shown since the intron/exon organisation of the mouse and human homologues is almost 
identical) and other members of the human VEGF/PIGF/PDGF gene family. Exons are 
10 represented by boxes. Protein coding regions and untranslated regions are shown by 
filled and open sections respectively. The hatched region in VRF indicates the 
additional 3' UTR sequence formed by alternate splicing of the VRFj 86 isoform. 
Potential alternate splice products of each gene are shown. 

1 5 Figure 13 Autoradiogram of a Northern blot of total RNA from various adult mouse 
tissues (as indicated) hybridised with an mVRF cDNA clone. A major transcript of 1.3 
kb was detected in all samples. 

Figure 14 Film autoradiographs (A-C) and dark-field micrographs (D-E) illustrating 
20 the expression pattern of mVRF and mRNA in the mouse. In the El 4 mouse embryo 
(A) positive signals are present over the developing heart (Ha) and cerebral cortex (Cx). 
A low background signal is also present over other tissues in the section. In the El 7 
embryo (B) and the heart (Ha) is clearly visible due to a strong hybridisation signal. An 
equally strong signal is present over brown adipose tissue (Fa) in the back and around 
25 the thoracic cage. A moderate hybridisation signal is present over the spinal cord (SC) 
and the tongue (T). The background signal is reduced compared with the E14 embryo. 
In the young adult mouse (C-D), positive signals are present over the heart (Ha) and 
adipose tissue (Fa) around the thoracic cage, while, for example, the lungs (Lu) are 
unlabeled). The hybridisation signal over the heart is evenly distributed over the entire 
30 left ventricle, including papillary muscles (D). In the El 7 heart hybridised with an 
excess of cold probe, no positive signal is present (E). Scale bars = 0.5 mm (A), 1.2 
mm (B), 1 mm (C), 0.3 mm (D), 0.1 mm (E). 
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Figure 15 Dark - (A and C) and bright-field (B and D) micrographs showing mVRF 
mRNA expression in mouse adipose tissue (A-B) and spinal cord (C-D). A strong 
hybridisation signal is present over fat (A), as shown by the strong labeling in Sudan 
black stained sections (B). A weak signal is present also in skeletal muscle (M in A-B). 
5 In the adult spinal cord (C) the mVRF probes gave a neuronal staining pattern over the 
gray matter, Toloudine counterstaining showing that motoneurons in the ventral horn 
(D), interneurons in the deep part of the dorsal horn and around the central canal (not 
shown) where largely positive for mVRF mRNA. Scale bars = 0.1 mm (A), 0.1 mm 
(B), 0.25 mm (C), 0.015 mm (D). 

10 

TABLE 1 

SUMMARY OF SEQUENCE IDENTITY NUMBERS 



15 


SEQ 


ID 


NO:l 


Nucleotide sequence of VEGF 165 




SEQ 


ID 


NO:2 


Amino acid sequence of VEGF 165 




SEQ 


ID 


NO:3 


Nucleotide sequence of SOM175 (VEGF-like molecules) 




SEQ 


ID 


NO:4 


Amino acid sequence of SOM175 




SEQ 


ID 


NO:5 


Nucleotide sequence of SOM175 absent exon 6 


20 


SEQ 


ID 


NO:6 


Amino acid sequence of SOM175 absent exon 6 




SEQ 


ID 


NO:7 


Nucleotide sequence of SOM175 absent exon 6 and exon 7 




SEQ 


ID 


NO:8 


Amino acid sequence of SOM175 absent exon 6 and exon 7 




SEQ 


ID 


NO:9 


Nucleotide sequence of SOM175 absent exon 4 




SEQ 


ID 


NO: 10 


Amino acid sequence of SOM175 absent exon 4 


25 


SEQ 


ID 


NO: 11 


Oligonucleotide 




SEQ 


ID 


NO: 12 


Oligonucleotide 




SEQ 


ID 


NO: 13 


Oligonucleotide 




SEQ 


ID 


NO: 14 


Oligonucleotide 
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EXAMPLE 1 

Human cDNA clones 

The original SOM175 cDNA was isolated by screening a human foetal brain library 
5 (XzapII, Stratagene) with the cosmid D11S750 (Larsson et al t 1992). The plasmid was 
excised "in vivo" and a single 1.1 kb cDNA was obtained. Three independent SOM175 
cDNAs clones were also isolated from a human foetal spleen library (Strategane, Uni- 
zap) using the above-mentioned SOM175 insert as a probe. Three clones were obtained: 
SOM175-4A, -5 A and -6A. SOM175-5A is an alternately spliced clone with exon 4 
10 being absent (SOM175-e4). These library screens were performed using hybridisation 
conditions recommended by the manufacturer of the library (Stratagene) and random 
primed insert of SOM175. 

Two partial human SOM175 cDNAs have also isolated from a XGTl 1 human melanoma 
15 cell line A2058 library (Clontech) cDNA library screens were performed using 
hybridisation conditions described by Church and Gilbert, 1984). In each case, the 
probe was generated by random priming of a PCR product derived from SOM175 (18f- 
700r). 

20 Mouse cDNA Clones 

Human SOM175 was also used to screen a mouse neonatal whole brain cDNA library 
(Unizap, Stratagene). Four non-chimeric clones were isolated: M175-A, B, C, D. All 
clones were partial cDNAs and M175-C contained several introns. Three of these 
cDNAs lacked the exon 6. 

25 

Another clone referred to as Ml was completely sequenced and was found to contain 
the full open reading frame plus part of the 5'utr and total 3'utr. 
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EXAMPLE 2 



DNA SEQUENCE ANALYSIS 



The entire sequence of the cDNA clone (SOM175) was compiled and is shown in 
5 Figure 2 with its corresponding amino acid sequence. This sequence was screened for 
open reading frames using the MAP program (GCG, University of Wisconsin). A single 
open reading frame of 672bp was observed (see Figure 2). There appears to be little 5' 
untranslated sequences (2bp). The 3' untranslated region appears to be complete as it 
includes a poly-adenylation signal and poly-A tail. 



Database homology searches were performed using the BLAST algorithm (run at NCBI, 
USA). This analysis revealed homology to several mammalian forms of VEGF (see 
Figure 3). The amount of homology between SOM175 and human VEGF 165 was 
determined using the BESTFIT program (GCG, University of Wisconsin; see Figures 
15 4 and 5). Nucleotide homology was estimated at 69.7% and protein homology was 
estimated as at least 33.3% identity and 52.5% conservation using BESTFIT analysis. 
BLAST analysis on nucleotide sequences revealed the almost complete match to a 
human expressed sequence tag EST06302 (Adams et al> 1993). 

20 These data indicate that SOM175 encodes a growth factor that has structural similarities 
to VEGF. Both genes show start and stop codons in similar positions and share discrete 
blocks of homology. All 8 cysteines as well as a number of other VEGF residues 
believed to be involved in dimerisation are conserved. These residues are Cysteine-47, 
Proline-70, Cysteine-72, Valine-74, Arginine-77, Cysteine-78, Glycine-80, Cysteine-81, 

25 Cysteine-82, Cysteine-89, Proline-91, Cysteine-122 and Cysteine-124 and are shown in 
Figure 6. Given the structural conservation between VEGF and the SOM175 gene 
product it is also possible that they share functional similarities. It is proposed that 
SOM175 encodes a VEGF-like molecule that shares some properties with VEGF but has 
unique properties of its own. The nucleotide sequence and corresponding amino acid 

30 sequence of VEGF 165 is shown in Figure 1. 



10 
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EXAMPLE 3 

The percentage similarity and divergence between VEGF 165 family and SOM175 family 
(protein) were analysed using the Clustal method, MegAlign Software, DNASTAR, 
5 Wisconsin. The results are shown in Tables 2.1 and 2.2. The alternatively spliced 
forms of SOM175 are abbreviated to SOM715-e6 where all of exon 6 is deleted; 
SOM715-e6 and 7 where all of exons 6 and 7 are deleted; and SOM175-e4 where all 
of exon 4 is deleted. The spliced form of SOM175 are shown in Figure 7. Genomic 
maps of SOM175 showing intron/exon boundaries are shown in Figure 8a and 8b. 

10 

Table 2.1 

A Percent nucleotide similarity between splice variants of SOM175 and 
human VEGF 165 

15 





VEGF 165 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF,« 


*** 34.9 


39.7 


41.4 


37.0 


SOM175 


*** 


98.9 


95.1 


99.2 


SOM175-e6 




*** 


98.8 


84.0 


SOM175-e6&7 






*** 


80.3 


SOM175-e4 








*** 
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B Percent nucleotide divergence between splice variants of SOM175 and 
human VEGF,^ 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 


41.7 


41.6 


41.7 


41.8 


SOM175 




*** 


0.2 


0.2 


0.0 


SOM175-e6 






*** 


0.0 


0.2 


SOM175-e6&7 








*** 


0.3 


SOM175-e4 










*** 



Table 2.2 

15 A Percent amino acid identity between splice variants of SOM175 and 
human VEGF,- 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 SOM175-e4 


VEGF 165 


*** 


31.4 


42.3 


33.5 


40.6 


SOM175 




*** 


74.7 


73.7 


99.1 


SOM175-e6 






*** 


76.8 


99.1 


SOM175-e6&7 








*** 


99.1 


SOM175-e4 










*** 
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B Percent amino acid divergence between splice variants of SOM175 and 
human VEGF^c 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 


65.7 


55.4 


54.6 


57.4 


SOM175 




*** 


19.9 


4.2 


0.0 


SOM175-e6 






*** 


0.0 


0.0 


SOM175-e6&7 








*** 


0.0 


SOM175-e4 










*** 



15 EXAMPLE 4 

BIOASSAYS TO DETERMINE THE FUNCTION OF SOM175 

Assays are conducted to evaluate whether SOM175 has similar activities to VEGF on 
endothelial cell function, angiogenesis and wound healing. Other assays are 
performed based on the results of receptor binding distribution studies. 

20 

Assays of endothelial cell function 

Endothelial cell proliferation. Endothelial cell growth assays as described in Ferrara 
& Henzel (1989) and in Gospodarowicz et al (1989). 

25 Vascular permeability assay. This assay, which utilises the Miles test in guinea pigs, 
will be performed as described in Miles & Miles (1952). 

Cell adhesion assay. The influence of SOM175 on adhesion of polymorphs to 
endothelial cells is analysed. 

30 

Chemotaxis. This is performed using the standard Boyden chamber chemotaxis 
assay. 
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Plasminogen activator assay. Endothelial cells are tested for plasminogen activator 
and plasminogen activator inhibitor production upon addition of SOM175 (Pepper et 
al (1991)). 

Endothelial cell migration assay. The ability of SOM175 to stimulate endothelial 
cells to migrate and form tubes is assayed as described in Montesano et al (1986). 

Angiogenesis Assay 

SOM175 induction of an angiogenic response in chick chorioallantoic membrane is 
evaluated as described in Leung et al (1989). 

Possible neurotrophic actions of SOM175 are assessed using the following assays: 

Neurite outgrowth assay and gene induction (PC12 cells) 

15 PC 12 cells (a phaeochromocytoma cell line) respond to NGF and other neurotrophic 
factors by developing the characteristics of sympathetic neurons, including the 
induction of early and late genes and the extension of neurites. These cells are 
exposed to SOM175 and their response monitored (Drinkvvater et al (1991); and 
Drinkwater et al (1993)). 

20 

Cultured neurons from the Peripheral Nervous System (PNS) 

Primary cultures of the following PNS neurons are exposed to SOM175 and 

monitored for any response: 

- sensory neurons from neural crest and dorsal root ganglia 
25 - sympathetic neurons from sympathetic chain ganglia 

- placode derived sensory neurons from nodose ganglia 

- motoneurons from spinal cord 

The assays are described in Suter et al (1992) and in Marinou et al (1992). 

30 Where an in vitro response is observed, in vivo assays for properties such as uptake 
and retrograde transport are performed as described in Hendry et al (1992). 



5 



10 



95 1 222 t p:\oper\cjh,NGFILprv, 1 7 




- 18 - 

Nerve regeneration (PNS) 

Where neurotrophic effects of SOM175 are observed, its possible role in the 
regeneration of axotomised sensory neurons, sympathetic neurons and motoneurons is 
analysed by the methods of Otto et al (1989); Yip et al (1984) and Hendry et al 
5 (1976). 

Actions of SOM175 on CNS neurons 

The ability of SOM175 to promote survival of central nervous system neurons is 
analysed as described in Hagg et al (1992); Williams et al (1986); Hefti (1986) and 
10 Kxomer (1987). 

Wound Healing 

The ability of SOM175 to support wound healing are tested in the most clinically 
relevant model available, as described in Schilling et al (1959) and utilised by Hunt 
15 et al (1967). 

The Haemopoietic System 

A variety of in vitro and in vivo assays on specific cell populations of the 
haemopoietic system are available and are outlined below: 
20 Stem Cells 
Murine 

A variety of novel in vitro murine stem cell assays have been developed using 
FACS-purified cells: 
(a) Repopulating Stem Cells 
25 These are cells capable of repopulating the bone marrow of lethally irradiated mice, 
and have the Lin*, Rh hi , Ly-6A/E + , c-kit + phenotype. The test substance is tested on 
these cells either alone, or by co-incubation with multiple factors, followed by 
measurement of cellular proliferation by 3 H thymidine incorporation. 
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(b) Late Stage Stem Cells 

These are cells that have comparatively little bone marrow repopulating ability but 
can generate D13 CFU-S. These cells have the Lin", Rh h \ Ly-6A/E + , c-kit + 
5 phenotype. The test substance is incubated with these cells for a period of time, 
injected into lethally irradiated recipients, and the number of D13 spleen colonies 
enumerated. 

(c) Progenitor-Enriched Cells 

10 These are cells that respond in vitro to single growth factors, and have the Lin", Rh hl , 
Ly-6A/E + , c-kit + phenotype. This assay will show if SOM175 can act directly on 
haemopoietic progenitor cells. The test substance is incubated with these cells in 
agar cultures, and the number of colonies enumerated after 7-14 days. 

1 5 Atherosclerosis 

Smooth muscle cells play a crucial role in the development or initiation of 
atherosclerosis, requiring a change in their phenotype from a contractile to a synthetic 
state. Macrophages, endothelial cells, T lymphocytes and platelets all play a role in 
the development of atherosclerotic plaques by influencing the growth and phenotypic 
20 modulations of smooth muscle cell. An in vitro assay that measures the proliferative 
rate and phenotypic modulations of smooth muscle cells in a multicellular 
environment is used to assess the effect of SOM175 on smooth muscle cells. The 
system uses a modified Rose chamber in which different cell types are seeded onto 
opposite coverslips. 

25 

Effects of SOM175 on bone 

The ability of SOM175 to regulate proliferation of osteoblasts is assayed as described 
in Lowe et al (1991). Any effects on bone resorption are assayed as described in 
Lowe et al (1991). Effects on osteoblast migration and changes in intracellular 
30 molecules (e.g. cAMP accumulation, alkaline phosphatase levels) are analysed as 
described in Midy et al (1994). 
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Effects on skeletal muscle cells 

Effects of SOM175 on proliferation of myoblasts and development of myotubes can 
be determined as described by Ewton et al (1980) and by Gospodarowicz et al 
(1976). 

5 

EXAMPLE 5 
CLONING MURINE VEGF DNA 

Isolation of cDNAs 

Murine VRF (mVRF) clones were selected from a lambda Zap new born whole brain 
10 cDNA library (Stratagene). Primary phage from high density filters (5 x 10 4 

pfu/plate) were identified by hybridisation with a 682bp 32 P-labelled probe generated 
by PCR from an hVRF cDNA (pSOM175) as described above. Hybridisation and 
stringent washes of nylon membranes (Hybond-N) were carried out at 65°C under 
conditions described by Church and Gilbert (1984). Positive plaques were picked, 
15 purified and excised in vivo to produce bacterial colonies containing cDNA clones in 
pBluescript SK-. 



Isolation of genomic clones 

Genomic clones were isolated from a mouse strain SV/129 library cloned in the 
20 lambda Fix II vector (Stratagene). High density filters (5 x 10 4 pfu/filter) were 
screened with a 563 bp 32 P-labelled probe generated by PCR amplification of the 
nucleotide 233-798 region of the mVRF cDNA (see Figure 9). Positive clones were 
plugged and re-screened with filters containing 400-800 pfu. Large scale phage 
preparations were prepared using the QIAGEN lambda kit or by ZnCl 2 purification 
25 (Santos, 1991). 



Nucleotide sequencing and analysis 

cDNAs were sequenced on both strands using a variety of vector-based and internal 
primers with Applied Biosystems Incorporated (ABI) dye terminator sequencing kits 
30 according to the manufacturer's specifications. Sequences were analysed on an ABI 
Model 3 73 A automated DNA sequencer. Peptide homology alignments were 
performed using the program BESTFIT (GCG, Wisconsin). 
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Identification of intron/exon boundaries 

Identification of exon boundaries and flanking regions was carried out using PCR 
with mouse genomic DNA or mVRF genomic lambda clones as templates. The 
primers used in PCR to identify introns were derived from the hVRF sequence and to 
5 allow for potential human-mouse sequence mismatches annealing temperatures 5- 
10°C below the estimated T m were used. All PCR products were sized by agarose 
gel electrophoresis and gel purified using QIAquick spin columns (Qiagen) and the 
intron/exon boundaries were sequenced directly from these products. In addition, 
some splice junctions were sequenced from subcloned genomic fragments of MVRF. 
10 Intron/exon boundaries were identified by comparing cDNA and genomic DNA 
sequences. 

Northern analysis 

Total cellular RNA was prepared from a panel of fresh normal adult mouse tisues 
15 (brain, kidney, liver, muscle) using the method of Chomczynski and Sacchi (1987). 
20jj.g of total RNA were electrophoresed, transferred to a nylon membrane (Hybond 
N, Amersham) and hybridised under standard conditions (Church & Gilbert, 1984). 
Filters were washed at 65°C in O.lxSSC (20xSSC is 3M NaCl/0.3M trisodium 
citrate), 0.1% SDS and exposed to X-ray film with intensifying screens at -70°C for 
20 1-3 days. 

Characterisation of mVRF cDNAs 

Murine VRF homologues were isolated by screening a murine cDNA library with an 
hVRF cDNA clone. Five clones of sizes varying from 0.8-1.5 kb were recovered 
25 and sequenced. The cDNA sequences were complied to give a full length 1041 bp 
cDNA sequence covering the entire open reading frame (621 bp or 564 bp depending 
on the splice form, see below) and 3' UTR (379 bp), as well as 163 bp of the 5' 
UTR (Figure 9). 

30 The predicted initiation codon matched the position of the start codon in hVRF. One 
other out of frame ATG was located at position -47 and two termination codons were 
observed upstream (positions -9 and -33, respectively) and in-frame with the putative 
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initiation codon. 

The predicted N-terminal signal peptide of hVRF appears to be present in mVRF 
with 81% identity (17/21 amino acids). Peptide cleavage within mVRF is expected 
5 to occur after reside 21 (Figure 10). These data suggest that mature mVRF is 
secreted and could therefore conceivably function as a growth factor. 

As with hVRF, two open reading frames (ORFs) were detected in cDNAs isolated by 
library screening. Four of five clones were found to be alternatively spliced and 
10 lacked a 101 bp fragment homologous to exon 6 of hVRF. The predicted peptide 
sequences of the two isoforms of mVRF were determined and aligned with the 
corresponding human isoforms (Figure 10). 

The message encoding mVRF lg6 contains a 621 bp ORF with coding sequences 
15 terminating at position +622, towards the end of exon 7 (Figure 9). The smaller 
message encoding mVRF 167 actually terminates downstream of the +622 TAG site 
due to a frame shift resulting from splicing out of the 101 bp exon 6 and the 
introduction of a stop codon (TGA) at position +666, near the beginning of exon 8 
(Figure 9). 

20 

The mVRF 186 protein has strong homology to the amino and central portions of 
VEGF while the carboxyl end is completely divergent an is alanine rich. mVRF 167 
possesses these similarities and also maintains homology to mVEGF right through to 
the C-terminus (Figure 11). The overall homology of mVRF 167 to hVRF 167 was 
25 85% identity and 92% similarity, respectively (Figure 10). Likewise, homology 
between mVRF 167 and mVEGF (Breier et al 9 1992) was 49% identity and 71% 
conservative amino acid substitution, respectively (Figure 11). 

A canonical vertebrate polyadenylation signal (AATAAA) (Birnstiel et al, 1986) was 
30 not present in the mVRF cDNA, however, the closely matching sequence GAT AAA 
is present at similar positions in both mouse and human VRF cDNAs (Figure 9). In 
contrast to hVRF, mVRF was found to contain an AC dinucleotide repeat at the 
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extreme 3' end of the 3' UTR (nucleotide positions 998 to 1011, Figure 9). 
Polymorphism of this repeat region was observed between some of the mVRF 
cDNAs, with the number of dinucleotides varying from 7 to 11. 

5 Genomic characterisation of mVRF 

Intron/exon boundaries (Table 3) were mapped using primers which flanked 
sequences homologous to the corresponding hVRF boundaries. Introns I, III, IV and 
VI of mVRF (Table 3, Figure 12) were smaller than the hVRF intervening 
sequences. The complete genomic sequence was compiled from the 5' UTR of 

10 mVRF through to intron VI, the largest intervening region (2.2 kb), by sequencing 
amplified introns and cloned genomic portions of mVRF. There was only one major 
difference in genomic structure between mVRF and hVRF and that was the exon 
7/intron VI boundary of mVRF was located lObp further downstream in relation to 
the cDNA sequence, hence exon 7 in mVRF is lObp longer than the corresponding 

15 exoninhVRF. 

Exons 6 and 7 are contiguous in mVRF, as has been found to occur in the human 
homologue. The strong sequence homology between exon 6 of mVRF and hVRF 
(Figure 1 0) suggests that this sequence is not a retained intronic sequence but rather 
20 encodes a functional part of the VRF 186 isoform. 

General intron/exon structure is conserved between the various members of the 
VEGF gene family (VEGF, PIGF, hVRF) and therefore it is not surprising that the 
overall genomic organisation of the mVRF gene is very similar to these genes 
25 (Figure 12). 

Previous comparative mapping studies have shown that the region surrounding the 
human multiple endocrine neoplasia type 1 disease locus on chromosome llql3 is 
syntenic with the proximal segment of mouse chromosome 19 (Rochelle et al 9 1992). 
30 Since the inventors have mapped the hVRF gene to within lkb of the human MEN1 
locus (see above) it is most likely that the murine VRF gene maps near the 
centromere of chromosome 19. 
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Expression studies of mVRF 

Northern analysis of RNA from adult mouse tissues (muscle, heart, lung and liver) 
showed that expression appears to be ubiquitous and occurs primarily as a major 
band of approximately 1.3kb in size (Figure 14). This is somewhat different to the 
5 pattern observed for hVRF in which two major bands of 2.0 and 5.5 kb have been 
identified in all tissues examined. The 1 .3 kb murine message presumably 
corresponds to the shorter of the human transcripts and the size variation thereof is 
most likely due to a difference in the length of the respective 5' UTRs. 

EXAMPLE 6 

EXPRESSION OF MURINE VEGF IN PRE- AND POST-NATAL MOUSE 
Animals 

Timed pregnant (n=4) and young adult (n=2) mice (C57 inbred strain, ALAB, 
Sweden) were sacrificed with carbon dioxide, and the relevant tissues were taken out 
and frozen on a chuck. Tissues were kept at -70°C until further use. Two 
gestational ages was used in this study; embryonic day 8 (E8), 14 and El 7. 

In situ hybridisation histochemistry 

In situ hybridisation was performed as previously described (Dagerlind et al 9 1992). 
20 Briefly, transverse sections (14^im) were cut in a cryostat (Microm, Germany), 
thawed onto Probe-On slides (Fisher Scientific, USA) and stored in black sealed 
boxes at -70°C until used. The sequences of the synthetic 42-mer oligonucleotides 
complementary to mRNA encoding mVRF were 

ACCACCACCTCCCTGGGCTGGCATGTGGCACGTGCATAAACG [SEQ ID 

25 NO: 11] (complementary to nt 120-161) and 

AGTTGTTTGACCACATTGCCCATGAGTTCCATGCTCAGAGGC [SEQ ID 
NO: 12] (complementary to nt 162-203). To detect the two alternative splice forms 
oligonucleotide GATCCTGGGGCTGGAGTGGGATGGATGATGTCAGCTGG [SEQ 
ID NO: 13] (complementary to nt xxx-xxx) and 

30 GCGGGCAGAGGATCCTGGGGCTGTCTGGCCTCACAGCACT [SEQ ID NO: 14] 
were used. The probes were labeled at the 3'-end with deoxyadenosine- 
alpha[thio]triphosphate [ 35 S] (NEN, USA) using terminal deoxynucleotidyl 



10 



15 
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transferase (IBI, USA) to a specific activity of 7-10 x 10 8 cpm/fig and hybridised to 
the sections without pretreatment for 16-18 h at 42°C. The hybridisation mixture 
contained: 50% v/v formamide, 4 x SSC (1 x SSC = 0.15M NaCl and 0.015M 
sodium-citrate), 1 x Denhardt's solution (0.02% each of polyvinylpyrrolidone, BSA 
5 and Ficoll), 1% v/v sarcosyl (N-lauroylsarcosine; Sigma), 0.02M phosphate buffer 
(pH 7.0), 10% w/v dextran sulfate (Pharmacia, Sweden), 250jag/ml yeast tRNA 
(Sigma), 500|ig/ml sheared and heat denatured salmon sperm DNA (Sigma) and 
200mM dithiothreitol (DTT; LKB, Sweden). In control sections, the specificity of 
both probes was checked by adding a 20-fold excess of unlabeled probe to the 

10 hybridisation mixture. In addition, adjacent sections were hybridised with a probe 
unrelated to this study which gave a different expression pattern. Following 
hybridisation the sections were washed several times in 1 x SSC at 55°C, dehydrated 
in ethanol and dipped in NTB2 nuclear track emulsion (Kodak, USA). After 3-5 
weeks the sections were development in D-19 developer (Kodak, USA) and cover- 

15 slipped. In some cases, sections were opposed to an autoradiographic film (Beta-max 
autoradiography film Amersham Ltd, UK) prior to emulsion-dipping. 

The four different probes gave identical hybridisation patterns in all tissues examined. 
Mouse VRF expression was detecting already in the E8 embryo, in which positive 

20 signal was recorded over structures most likely corresponding to the neuronal tube. 
In sagittal sections of E14 mouse embryo the strongest hybridisation signal was 
present over heart and in the nervous system, especially cerebral cortex (Figure 14A). 
A low level of expression was present in all other tissues. At a later gestational age, 
El 7, a high mVRF mRNA signal was confined to he heart and brown fat tissue in 

25 the back and around the neck (Figure 14B). Clearly positive hybridisation signals 
were present in the gray of the spinal cord and in the tongue (Figure 14B). 
Expression in the cerebral cortex was clearly reduced compared to day 14. The weak 
background expression seen in the El 4 embryo in for example muscle, had decreased 
at this gestational age. A strong mVRF mRNA hybridisation signal was present 

30 solely over the heart and in the brown fat in the young adult mice (Figure 14C). The 
signal over the heart was evenly distributed ove the entire ventricular wall, including 
the papillary muscles (Figure 14D). In sections of heart tissue hybridised with an 
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excess of cold probe, no specific labeling over background signal was recorded 
(Figure 14E). 

Apart from the heart, mVRF mRNA signal was present over certain tissues on the 
5 outside of the thoracic cage that morphologically resembled brown fat. This was 
verified with sudan black counterstaining, which showed a strong staining in the 
same areas (Figure 15A and 15B). In transverse sections of adult mouse spinal cord, 
the mVRF probes gave a neuronal staining pattern over the gray matter (Figure 15C). 
Counterstaining with toluidine (Figure 15D) showed that motoneurons in the ventral 
10 horn (Figure 15C and 15D), interneurons (Figure 15C) in the deep part of the dorsal 
horn and around the central canal where to a large extent positive for mVRF mRNA. 

Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. It 
1 5 is to be understood that the invention includes all such variations and modifications. 
The invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and 
all combinations of any two or more of said steps or features. 



20 TABLE 3 

Splice junctions of the murine VRF gene 



5' UTR* 


Exon 


1 


>223bp 


CCCAGgtacgtgcgt 


Intron 


I 


4 9 5bp 


ttccccacagGCCCC 


Exon 


2 


4 3bp 


GAAAGgtaataatag 


Intron 


II 


2 88bp 


ctgcccacagTGGTG 


Exon 


3 


197bp 


TGCAGgtaccagggc 


Intron 


III 


19 6bp 


ctgagcacagATCCT 


Exon 


4 


7 4bp 


TGCTAGgtgccagccc 


Intron 


IV 


182bp 


ctcttttcagACCTA 


Exon 


5 


3 6bp 


GACAGattcttggtg 


Intron 


V 


191bp 


ct cc t cc tagGGTTG 


Exon 


6 


lOlbp 




(no 


intron) 




CCCACTCCAGCCCCA 


Exon 


7 


13 5bp 


TGTAGgtaaggagtc 


Intron 


VI 


2200bp 


cactccccagGTGCC 


Exon 


8 


3 94bp 


AGAGATGGAGACACT 









Uppercase and lowercase letters denote exonic and intronic sequences respectively. 
* Indicates that the 5' end of exon 1 has not yet been determined. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: THE COUNCIL OF THE QUEENSLAND INSTITUTE 

OF MEDICAL RESEARCH and AMRAD 
CORPORATION LIMITED 

(ii) TITLE OF INVENTION: A NOVEL GROWTH FACTOR AND A 

GENETIC SEQUENCE ENCODING SAME - II 
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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: AU PROVISIONAL 

(B) FILING DATE: 22-DEC-1995 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES DR, E JOHN L 

(C) REFERENCE/DOCKET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 
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(2) INFORMATION FOR SEO ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17... 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 4 9 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 
1 5 10 

CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 97 
Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 
15 20 25 

CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 14 5 

Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 193 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val 
45 50 55 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 241 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 
60 ^ 65 70 75 

TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 28 9 

Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 

CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 337 
Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 3 85 

Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 433 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 
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GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 4 81 

Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 155 

CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 52 9 

Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 577 
Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAGCCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 629 
Lys Pro Arg Arg 
190 

GAACCAGATC TCTCACCAGG 64 9 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Asn Phe Leu 
1 

Tyr Leu His His 
20 

Gly Gly Gin Asn 
35 

Arg Ser Tyr Cys 
50 

Tyr Pro Asp Glu 
65 

Met Arg Cys Gly 



Thr Glu Glu Ser 
100 

Gin Gly Gin His 
115 



Leu Ser Trp Val 
5 

Ala Lys Trp Ser 



His His Glu Val 
40 

His Pro lie Glu 
55 

He Glu Tyr He 
70 

Gly Cys Cys Asn 
85 

Asn He Thr Met 



He Gly Glu Met 
120 



His Trp Ser Leu 
10 

Gin Ala Ala Pro 
25 

Val Lys Phe Met 



Thr Leu Val Asp 
60 

Phe Lys Pro Ser 
75 

Asp Glu Gly Leu 
90 

Gin He Met Arg 
105 

Ser Phe Leu Gin 



Ala Leu Leu Leu 
15 

Met Ala Glu Gly 
30 

Asp Val Tyr Gin 
45 

He Phe Gin Glu 



Cys Val Pro Leu 
80 

Glu Cys Val Pro 
95 

He Lys Pro His 
110 

His Asn Lys Cys 
125 
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Glu Cys Arg Pro Lys Lys Asp Arg 
130 135 

Pro Cys Ser Glu Arg Arg Lys His 
145 150 

Cys Lys Cys Ser Cys Lys Asn Thr 
165 ■ 

Leu Glu Leu Asn Glu Arg Thr Cys 
180 
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Ala Arg Gin Glu Asn Pro Cys Gly 
140 

Leu Phe Val Gin Asp Pro Gin Thr 
155 160 

Asp Ser Arg Cys Lys Ala Arg Gin 
170 175 

Arg Cys Asp Lys Pro Arg Arg 
185 190 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3. .624 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 47 
Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 
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CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 335 
Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 3 83 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 47 9 

Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 575 
Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 

GCT GCC GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 624 
Ala Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 

AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 684 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT AT ATC CCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 804 

GCCTCTCAGA GGGCTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 864 

GAGTTGGAAG AGGAGACTGG GAGGCAGCAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 

CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 1094 
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(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
1 5 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 



Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp -Ser Ala 
130 

Pro Gin Pro Arg 
145 

Ser Pro Ala Asp 



His Ala Ala Pro 
180 

Ala Ala Ala Asp 
195 



Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
120 

Val Lys Pro Asp 
135 

Ser Val Pro Gly 
150 

lie Thr His Pro 
165 

Ser Thr Thr Ser 



Ala Ala Ala Ser 
200 



Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Arg Ala Ala Thr 
140 

Trp Asp Ser Ala 
155 

Thr Pro Ala Pro 
170 

Ala Leu Thr Pro 
185 

Ser Val Ala Lys 



Gly Gin His Gin 
95 

Ser Gin Leu Gly 
110 

Arg Pro Lys Lys 
125 

Pro His His Arg 



Pro Gly Ala Pro 
160 

Gly Pro Ser Ala 
175 

Gly Pro Ala Ala 
190 

Gly Gly Ala 
205 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3 . . 566 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 47 
Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
1 5 10 .15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thx 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 239 
Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 3 35 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 383 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGC CCC AGG CCC CTC TGC CCA 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro 
130 135 140 
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CGC TGC ACC CAG CAC CAC CAG CGC CCT GAC CCC CGG ACC TGC CGC TGC 4 79 

Arg Cys Thr Gin His His Gin Arg Pro Asp Pro Arg Thr Cys Arg Cys 
145 150 155 

CGC TGC CGA CGC CGC AGC TTC CTC CGT TGC CAA GGG CGG GGC TTA GAG 52 7 

Arg Cys Arg Arg Arg Ser Phe Leu Arg Cys Gin Gly Arg Gly Leu Glu 
160 165 170 175 

CTC AAC CCA GAC ACC TGC AGG TGC CGG AAG CTG CGA AGG TGACACATGG 5 76 

Leu Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 
180 185 

CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG GGAACAAAGG 63 6 

GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGACCTCAGC CCAGGCAGAA GCTGCTCTAG 6 96 

GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGAGGC CATCATCAAA 756 

CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA CCAGCTCAGG 816 

GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT CTTACAACTG 8 76 

GCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTGCAT AATGGGATTT GGGCTTTGGT 936 

ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA AAAAAAA 9 93 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
- 4 (A) LENGTH: 18 8 "amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 
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Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu Gly 
100 105 110 

Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys Lys 
115 120 125 

Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro Arg 
130 135 140 

Cys Thr Gin His His Gin Arg Pro Asp Pro Arg Thr Cys Arg Cys Arg 
145 150 155 160 

Cys Arg Arg Arg Ser Phe Leu Arg Cys Gin Gly Arg Gly Leu Glu Leu 
165 170 175 

Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 
180 185 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 47 
Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 
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GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 28 7 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 383 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGG TGC CGG AAG CTG CGA AGG 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Cys Arg Lys Leu Arg Arg 
130 135 140 

TGACACATGG CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG 4 91 

GGAACAAAGG GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGACCTCAGC CCAGGCAGAA 551 

GCTGCTCTAG GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGAGGC 611 

CATCATCAAA CAGGACAGAG TTGGAAGAGG AGACTGGGAG GCAG CAAGAG GGGTCACATA 671 

CCAGCTCAGG GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT 731 

CTTACAACTG GCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTGCAT AATGGGATTT 791 

GGGCTTTGGT ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA 851 

AAAAAAA 858 
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(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Ser Pro Leu 
1 

Ala Pro Ala Gin 
20 

Arg Lys Val Val 
35 

Pro Arg Glu Val 
50 

Ala Lys Gin Leu 
65 

Cys Cys Pro Asp 



Val Arg Met Gin 
100 

Glu Met Ser Leu 
115 

Lys Asp Ser Ala 
130 



Leu Arg Arg Leu 
5 

Ala Pro Val Ser 



Ser Trp lie Asp 
40 

Val Val Pro Leu 
55 

Val Pro Ser Cys 
70 

Asp Gly Leu Glu 
85 

lie Leu Met lie 



Glu Glu His Ser 
12 0 

Val Lys Pro Asp 
135 



Leu Leu Ala Ala 
10 

Gin Pro Asp Ala 

25 

Val Tyr Thr Arg 



Thr Val Glu Leu 
60 

Val Thr Val Gin 
75 

Cys Val Pro Thr 
90 

Arg Tyr Pro Ser 
105 

Gin Cys Glu Cys 



Arg Cys Arg Lys 
140 



Leu Leu Gin Leu 
15 

Pro Gly His Gin 
30 

Ala Thr Cys Gin 
45 

Met Gly Thr Val 



Arg Cys Gly Gly 
80 

Gly Gin His Gin 
95 

Ser Gin Leu Giy 
110 

Arg Pro Lys Lys 
125 

Leu Arg Arg 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 2 87 

Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ACC TAAAAAAAAG GACAGTG CTG TGAAGCCAGA 3 35 
Gin Val Arg Met Gin Thr 
100 

CAGGGCTGCC ACTCCCCACC ACCGTCCCCA GCCCCGTTCT GTTCCGGGCT GGGACTCTGC 3 95 

CCCCGGAGCA CCCTCCCCAG CTGACATCAC CCATCCCACT CCAGCCCCAG GCCCCTCTGC 455 

CCACGCTGCA CCCAGCACCA CCAGCGCCCT GACCCCCGGA CCTGCCGCTG CCGCTGCCGA 515 

CGCCGCAGCT TCCTCCGTTG CCAAGGGCGG GGCTTAGAGC TCAACCCAGA CACCTGCAGG 575 

TGC CGGAAGC TGCGAAGGTG ACACATGGCT TTTCAGACTC AGCAGGGTGA CTTGCCTCAG 635 
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AGGCTATATC CCAGTGGGGA ACAAAGAGGA GCCTGGTAAA AAACAGCCAA GCCCCCAAGA 6 95 

CCTCAGCCCA GGCAGAAGCT GCTCTAGGAC CTGGGCCTCT CAGAGGGCTC TTCTGCCATC 75 5 

CCTTGTCTCC CTGAGGCCAT CATCAAACAG GACAGAGTTG GAAGAGGAGA CTGGGAGGCA 815 

GCAAGAGGGG TCACATACCA GCTCAGGGGA GAATGGAGTA CTGTCTCAGT TTCTAACCAC 8 75 

TCTGTGCAAG TAAGCATCTT ACAACTGGCT CTTCC 910 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25. 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin Thr 
100 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ACCACCACCT CCCTGGGCTG GCATGTGGCA CGTGCATAAA CG 42 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



AGTTGTTTGA CCACATTGCC CATGAGTTCC ATGCTCAGAG GC 42 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



GATCCTGGGG CTGGAGTGGG ATGGATGATG TCAGCTGG 38 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GCGGGCAGAG GATCCTGGGG CTGTCTGGCC TCACAGCACT 4 0 



DATED this 22nd day of December, 1995 

THE COUNCIL OF THE QUEENSLAND INSTITUTE OF MEDICAL RESEARCH 
and AMRAD CORPORATION LIMITED 

By Its Patent Attorneys 

DAVIES COLLISON CAVE 
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FIGURE 1 

TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 4 9 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 
15 10 

CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 97 
Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 
15 20 25 

CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 14 5 

Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 
30 35 40 

ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 193 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val 
45 50 55 

GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 241 
Asp lie Phe Gin Glu Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro 
60 65 70 75 

TCC TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 2 89 

Ser Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 

CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 3 37 

Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 3 85 

Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 433 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 

GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 4 81 

Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 155 

CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 529 
Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 5 77 

Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAGCCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 62 9 
Lys Pro Arg Arg 
190 

GAACCAGATC TCTCACCAGG 649 
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FIGURE 2 

CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 47 
Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu. Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 23 9 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 335 
Gin Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 3 83 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 4 79 

Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 527 
Pro Ser Pro Ala Asp lie Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 575 
Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 

GCT GCC GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 624 
Ala Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 
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FIGURE 2 (continued...) 

AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 684 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAGGACCTGG 804 

GCCTCTCAGA GGGCTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 864 

GAGTTGGAAG AGGAGACTGG GAGGCAGCAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 

CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 1094 
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FIGURE 3 



> V£G:_r T JMAN V E G F_HUMA2<' VASCULAR E>7I;OTHZL-AL GROWTH FACTOR PRECURSOR (^Gr) 
(VASCULAR 215 AA . 
Length = 215 

Score = 181 (92.4 bits}, Expect = 6.4e-20, ? = 6 . 4e-20 
Identities = 33/75 (44%) , Positives = 43/75 (54%) 

Query: 31 HQRKWSWIDVYTRATCQPREVWPL^ 90 

+ W -*-DVY R+ C + P E +V + E - PSCV + RCGGCC D+GLECV 

Sb^ct: 36 NKHEV-vT<FMDV^QRSY 95 

Query: 91 PTGQKQVRMQILMIR 105 

PT + + MQI- I-r 
Sbjct: 96 PTEESNITMQIMRIK 110 

Score = 76 (38.8 bits), Expect = 0.0011,. Poisson ?(2) = S.le-12 
Identities = 12/19 (63%) , Positives = 16/19 (84%) 

Query: 110 QLGEMSLEEKSQCECRPKX 128 

t+GEMS +H+ CECRPKX 
Sbjct: 116 HIGEMSFLQHNXCECRPKX 134 

Score = 72 (36.8 bits), Expect = 0.0046, Poisson ?(3) = 3.6e-18 
Identities = 14/21 (66%) , Positives = 15/21 (71%) 

Query: 2 02 RCQGRGLELNPDTCRCRKLRR 22 2 

RC +R LELN TCRC K RR 
SbjCTl : 155 RCKAR.CLZLNEF.TCRCDKPRR 2 _ z 

Score = 46 (23.5 bits), Expect =47., Poisson ?(4) = 7.3e-10 
Identities = 6/10 (60%), Positives = 9/10 (90%) 



Query 
3b j ct 



13 7 DPRTCRCRCR 196 

DP+TC + C C-r 
131 DPQTCKCSCX 19 0 



FIGURE 4 



Gap Weight: 3.000 Average Match: 1.U00 
Length V, eight: 0.100 ?rage Mismatch: -0.900 

Length: 739 
Gaps: 30 



Quality: 100.9 
Ratio: 0.175 



Percent Similarity: 69.703 Percent Identity: 69.703 
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FIGURE 5 



165SOMSQ.MSF.msf MSF: 687 Type: D Tuesday, June 20, 1995 Check: 3140 

1 80 
VEGF16 5 ATGAACTTTCTGCTGTCTTGGGTGCATTGGAGCCTTGCCTTGCTGCTCTACCTCCACCATGCCAAGTGGTCCCAGGCTG . 
SOM175 ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
SOM175-e6 ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
SOM175-e6&7 ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 
SOM175-e4 ATGAGCCCTCTGCTCCGCCGCCTGCTGCTCGCCGCACTCCTGCAGCTGGCCCCCGCCCAGGCCCCTGTCTCCCAGCCTGA 



81 



160 



VEGF1 6 5 C AC C CATGG CAG AAGG AGG AG GG CAG AAT C AT CACG AAG TGG TG AAG TT CATGG ATG T CT AT CAG CG CAG CT ACTG C CAT 

SOM1 7 5 TGCCCCTGGC C AC CAG AGG AAAG TGGTG T C ATG GAT AG ATG TG T AT ACT CG CG CTACCTGC . CAGCC . CCGGGAG 

SOM1 7 5 -e6 TGCCCCTGGCCACCAGAGGAAAGTGGTGTCATGGATAGATGTGTATACTCGCG CTACCTGC . CAGCC . CCGGGAG 

SOM17 5 -e6&7 TGCCCCTGGCCACCAGAGGAAAGTGGTGTCATGGATAGATGTGTATACTCGCG CTACCTGC . CAGCC . CCGGGAG 

SOM175 -e4 TGCCCCTGGCCACCAGAGGAAAGTGGTGTCATGGATAGATGTGTATACTCGCG CTACCTGC . CAGCC . CCGGGAG 



VEGF165 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



161 



240 



CCAATCGAGACCCTGGTGGACATCTTCCAGGAGTACCCTGATGAGATCGAGTACATCTTCAAGCCATCCTGTGTGCCCCT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGGCACCGTGGCCAAAC . . AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGGCACCGTGGCCAAAC . . AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGGCACCGTGGCCAAAC . . AGCTGGTGCCCAG CTGCGTGACTGT 

GTGGTGGTGCCCTTGACTG . TGGAGCTCATGGGCACCGTGGCCAAAC . . AGCTGGTGCCCAG CTGCGTGACTGT 



241 



320 



VEGF1 6 5 GATGCGATGCGGGGGCTGCTGCAATGACGAGGGCCTGGAGTGTGTGCCCACTGAGGAGTCCAACATCACCATGCAGATTA 

SOM1 7 5 GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGATCC 

SOM175-e6 G CAG CG CTG TGGTGG CTG CTG C C CTG ACG ATGG CCTGG AG TG TG TG C C CACTGGG CAG CAC CAAG T C CGG ATG CAG AT C C 

SOM1 7 5 - e 6 Scl GCAGCGCTGTGGTGGCTGCTGCCCTGACGATGGCCTGGAGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGATCC 

SOM175 - e4 G CAG CGCTGTGGTGGCTGCTGCCCTGACGATGG CCTGG AGTGTGTGCCCACTGGGCAGCACCAAGTCCGGATGCAGA. . . 



VEGF16 5 
SOM175 
SOM175-e6 
SOMl75-e6Sc7 
SOM175-e4 



321 



400 



TGCGGATC^^CCTCACCAAGGCCAGCAC^TAGGAGAGATGAGCTTCCTACAGCACAACAAATGTGAATGCAGACC . . . A 
TCATGATCCGG . . . T AC C C G AG CAG T CAG C TGGGGG AG ATG T C C C TGG AAG AACACAG C CAGTG TG AATG CAG AC C T AAA 
T CAT GAT C CGG . . . T AC C CG AG CAG T CAG CTGG GGG AG ATGT C C CTGG AAG AA CACAG C CAG TGTG AATG CAG AC C T AAA 
TCATGATCCGG . . - TACCCGAGCAGTCAGCTGGGGGAGATGTCCCTGGAAGAACACAGCCAGTGTGAATGCAGACCTAAA 

CCTAAA 



VEGF16 5 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



401 480 

AAGAAAGATAG AGCAAGACAAGAA . . . . AATCCCTGTGG G C CTTG CT CAG AG CGG AG A 

AAAAAGGACAGTGCTGTGAAGCCAGACAGGGCTGCCACTCCCCACCAcCGTCCCCAGCCCCGTTCTGTTCCGGGCTGGGA 

AAAAAGG ACAGTG CTG TG AAG C CAG AT AG 

AAAAAGGACAGTGCTGTGAAGCCAGATAG 

AAAAAGGACAGTGCTGTGAAGCCAGACAGGGCTGCCACTCCCCACCACCGTCCCCAGCCCCGTTCTGTTCCGGGCTGGGA 



VEGF16 5 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



VEGF165 
SOM175 
SOM175-e6 
SOM175-e6&7 
SOM175-e4 



481 



560 



AAGCA TTTGTT TGTAC . .A 

CTCTGCCCCCGGAGCACCCTCCCCAGCTGACATCACCCATCCCACTCCAGCCCCAGGCCCCTCTGCCCACGCTGOVCCCA 
CCCCAGGCCCCTCTGCCCACGCTGCACCCA 



CTCTGCCCCCGGAGCACCCTCCCCAGCTGACATCACCCATCCCACTCCAGCCCCAGGCCCCTCTGCCCACGCTGCACCCA 



561 



640 

A GATCCGCAGACGTGTAAATGTTCCTGCAAAAAC . ACAGACTCG . . CGTTG CAAGG CG AGG CAG C 

GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 
GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 

GCACCACCAGCGCCCTGACCCCCGGACCTGCCGCTGCCGCTGCCGACGCCGCAGCTTCCTCCGTTGCCAAGGGCGGGGCT 



641 

VEGF1 6 5 TTGAGTTAAACGAACGTACTTGCAGATGTGACAAGCCGAGGCGGTGA 
SOM1 7 5 TAGAGCTCAACCCAGACACCTGCAGGTGCCGGAAGCTGCGAAGGTGA 
SOM1 7 5 - e 6 TAGAGCTCAACCCAGACACCTGCAGGTGCCGGAAGCTGCGAAGGTGA 

SOM1 7 5 - e 6 &7 GTGCCGGAAGCTGCGAAGGTGA 

SOM175-e4 T AGAG C T CAAC C CAG A CAC CTGCAGGTG C CGG AAG CTG CG AAGGTG A 



687 



FIGURE 6 



VEGF I65 
SOM175 shon 

VEGF 16S 
SOM175 sho „ 

VEGF 1M 
S0M175 sll011 

VEGF I65 
SOMI75 shon 
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K PKQGQH IjGE 
R . YPSSQUC? 



FLQHN KjCECRPjCk 



DRAl 



RQENpCQ^CSERRKfcjLF . VGDP 

thptpapgIf 



lee^sc £e£r|^ 



jOT 
ISA 



ckcsckntdsrckarglelnertcrcd53prr 
haapsttsaltpgpaaaaadaaassv a;k|g g a 



6C 
55 

12" 

i 15 

170 
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19 ■ 
207 



or... 

VEGF 165 
SOM175 lon| . 

VEGF 165 
SOM175 lomi 



VEGF I65 
SOM175 lon|i 



!fl| 



NFtCl 



SW V HW51?SlL3 ySh k2 K WS 0 a[a3m A EGGGGw3hE .^KFfiD^GR 
RRLL . -|L^Ajli|G|APgG ^ V SQPD APG^QRK^SW lj^T| 



SYg^IJ 



TL | D 
VVjVjP 



IFQlYPDE IEY IFKjESC^ 
L T V:BL MG T V AK QL viRSiai! 



P L MfiLGSEB nBe GLHilVETlE ESN I iRQ^MRfljl 
T V QjBiIGS^P|D|D BligfiVgfjG 0 H 0 V RfcffjjL Mjl 



mr|i|kphggghi^_ 

R . YPSSOLPEff 



.R-CEN? 



F L Q H N k!cE£RPK^ . DRAT . 

L E E H 5 dCE£RPffljK D S A VK P PR^A.T P HHRp Q p R S V P G WD S A ? G AP SP AD I T HP T P AP G PL 



60 
55 

12: 
1 15 



170 
177 



VEGF„ 5 G R G S E R R K H L r V QD^Q jFckQs CK N T D S . R CjK a|^ 0;L : E L|^E R f^^D^PiR R 

SOM175 lonfi PRCjTOHHCR . . . Pb^R:TC]R ! gRCR RR SFL^QgGj^PDjCG^gLgR 



19 ! 
222 



Areas of 100% homology are boxed and conserved residues thought to be involved in homodimensanon are 
underlined. The VEGF sequence depicted includes the 26 amino acid leader sequence (removal of which 
gives rise to mature VEGF I65 ) giving a total length of 191 amino acids. 

Homoloav of SOM175 to VEGF 16 , is 27% (33%) at the protein level, however within this are blocks of 
100% homology. In particular, many structural residues are conserved including those thought to be 
involved in homodimerisation of VEGF (by comparison with PDGF). 
ie. Cysteine-47 

Proline-70, Cysteine-72. Valine-74 

Arginine-77, Cysteine-78. Glycine-80, Cysteines-81 & 82 
Cysteine-89, Proline-91 
Cysteines 122 & 124 
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FIGURE 8A 




FIGURE 8B 



5'UTR... ATGAGG *Exon 1 (60bp) 

tctcccacag GCCCCT Exon 2 (4 3bp) 

tctgctccca TGGTGT Exon 3 187bp) 

ctgaatacag ATCCTC Exon 4 (73bp) 

acttttcaag ACCTAA Exon 5 (34bp) 

ctcctccgta GGCTGC Exon 6 (lOlbp) 

cccactccag CCCCAG Exon 7 (109bp) 

ccctgctcag GTGCCG *Exon 8 (22bp) 



GGCCAG 
GGAAAG 
ATGCAG 
ATGCAG 
AGACAG 
CTCCAG 
ACCCAG 
AGGTGA 



gtacg ^.gagg 
aatacttaca 
gtccgagctg 
gtgtcaggca 
gtgagtcttt 
ccccaggccc 
acacctgtag 
. . . 3 ' UTR 



FIGURE 9 



gcacaagctcaggccgccgctccggcgccccczzgcgccccctgcgcccagggctcggga 
-103 gggggccgcggaggagccgccccctgcgccccgccccggg^ccccgggt ccgcgccatgg 
-43 ggcggctctggccgacccccccccacaccgcccggctagggcccgATGAGCCCCCTGCTG 

M S ? L L -I'/ 

16 cgtcgcctgctgcttgttgcactgctgcagctggctcgcacccaJgcccctgtgtcccag 

rrlll'vallqlartqapvsq 4 

7 6 tttgatggccccagtcaccagaagaaaJtggtgccatggatagacgtttatgcacgtgcc 

fdgpshqkkvvpwidvyara 24 

13 6 acatgcc agcccagggaggtggtggtgcctctgagcatggaactcatgggcaatgtggtc 

tcqprevvvplsmelmgnvv 44 

19 6 aaacaactagtgcccagctgtgtgactgtgcagcgctgtggtggctgctgccctgacgat 

kqlvpscvtvqrcggccpdd 64 

2 5 6 ggcctggaatgtgtgcccactgggcaacacc aagtccgaatgcac^tcctcatgatccag 

glecvptgqhqvrmqilmiq 84 

316 TACCCGAGC AGTCAGCTGGGGGAGATGTCCCTGGGAGAACACAGCCAATGTGAATGCAC^. 

YPSSQLGEMSLGEHSQCECR 104 

f 

3 7 6 CCTAAAAAAAAGGAGAGTGCTGTGAGGCCAGACAG GGTTGCCATACCCCACCACCGTCCC 

PKKKESAVRPDRVAI PHHRP 124 

43 6 CAGCCCCGCTCTGTTCCGGGCTGGGACTC TACCCCGGGAGCACCCTCCCCAGCTGACATC 

QPRSVPGWDSTPGAPSPADI 144 

4 9 6 ATCCATCCCACTCCAC^ CCCAGGATCCTCTGCCCGCCTTGCACCCAGCGCCGCCAACGCC 

IHPTPAPGSSARLAPSAAN-A 164 
SPRILCPPCTQRRQR 130 

5 5 6 CTGACCCCCGGACCTGCCGTTGCCGCTGTAGACGCCGCCGCTTCCTCCATTGCC AAGGGC 

LT PG PAVAAVDAAAS S I A KG 184 
PDPRTCRCRCRRRRFLHCQG 150 

616 GGGGCT TAG AGCTCAACCCAGACACCTGTAC^STGCCGGAAGCCGCGAAAGTGAcaagctg 

G A * 186 
RGLELNPDTCRCRKPRK* 167 

67 6 ctttccagactccacgggcccggctgctt t tatggccctgct tcacagggagaagagtgg 

73 6 agcacaggcgtaacctcctcagtctgggaggtcactgccccaggacctggaccttttaga 

7 96 gagctctctcgccatcttttatctcccagagctgccatctaacaattgtcaaggaacctc 

85 6 atgtctcacctcaggggccagggtactctctcacttaaccaccctggtcaagtgagcatc 

916 ttctggctggctgtctcccctcactatgaaaaccccaaacttctaccaataacgggattt 

97 6 gggttctgttatgataactgtgacacacacacacactcacactctgataaaagagatgga 

103 6 gacactaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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